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Abstract. QCD is an extensively developed and tested gauge theory, which models the strong 
interactions in the high-energy regime. In this talk, I shall review the considerable progress which 
has been achieved in the last few years in the most actively studied QCD topics: Monte Carlo models, 
higher-order corrections, and parton distribution functions. Thanks to that, QCD in the high-energy 
regime is becoming more and more an essential precision toolkit to analyse Higgs and New Physics 
scenarios at the LHC. 
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1. Introduction 

Quantum chromodynamics (QCD) is by now widely accepted as the theory which de- 
scribes the strong interactions between hadrons and their components, the quarks and 
gluons. From a theoretical point of view, it is a gauge field theory featuring asymptotic 
freedom, i.e. a coupling that grows weaker at smaller distances. However, the strong 
interactions also feature confinement, that is the lack of colour of the observed hadrons. 
Although in the non-perturbative (low Q 2 ) regime several clever approaches to QCD, like 
lattice gauge theory, Regge theory, chiral perturbation theory, large N c , are used, a com- 
plete theoretical solution to confinement is not yet available and is difficult to obtain, be- 
cause the QCD Lagrangian is formulated in terms of quarks and gluons, rather than the 
observed hadrons, and because at large distances, i.e. at low Q 2 , the coupling is strong. 
Conversely, at small distances, i.e. in the high-energy regime, the coupling is weak and 
thus it is possible to make use of the perturbative framework. In this talk I shall focus 
on the fact that perturbative QCD (pQCD), i.e. QCD in the weak-coupling (high Q 2 ) 
regime, has emerged as an essential precision toolkit for exploring Higgs and Beyond-the- 
Standard-Model (BSM) physics; and that is even more so at the Large Hadron Collider 
(LHC), because of the strongly interacting colliding protons. In the box of the precision 
toolkit, pQCD provides the tools for a precise determination of the strong coupling con- 
stant, a s , of the parton distribution functions (p.d.f.), of the electroweak parameters and of 
the LHC parton luminosity; and of the strong corrections to Higgs and BSM signals and to 
their backgrounds. 
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In any scattering process in high-energy QCD, the value of any observable can be ex- 
panded in principle as a series in a s . Thus a s represents the single most important piece of 
information we need. In the MS scheme and using next-to-next-to-leading-order (NNLO) 
results only, the 2004 world average [1] yields a s {M z ) = 0.1182 ± 0.0027. 

The fact that in the detectors experiments observe hadrons, while through the QCD La- 
grangian we can only compute the scattering between partons, calls for a framework where 
the short-distance physics, which is responsible for the primary scattering between partons, 
can be separated from the long-distance physics, which describes the parton densities in 
the initial state and the hadronisation in the final state. That framework is provided by the 
pQCD and factorisation. 

In the beginning there was the parton model and the Bjorken scaling, which constitute 
the backbone of pQCD. The latter computes the logarithmic scaling violations, that is the 
logarithmic corrections to the parton-model predictions. The pQCD tenets are the univer- 
sality of the infrared (IR) behaviour, the cancellation of the IR singularities for suitably 
defined variables, like jets and event shapes, and in the case of hadron-initiated processes, 
like electron-proton or (anti)proton-proton collisions, the factorisation of the short- and 
long-range interactions. 

Factorisation in proton-proton (pp) collisions states that the cross section for the pro- 
duction of high-mass states, like vector bosons, Higgs bosons, heavy quarks or large-£r 
jets, characterised by the large scale Q 2 , can be expressed as a convolution of short- and 
long-distance pieces, with the matching between the two pieces occurring at an arbitrary 
scale lip, called factorisation scale. The short-distance piece is the parton cross section 
for the primary event, describing how two partons out of the incoming protons collide to 
produce the high-mass states plus hard radiation; the long-distance pieces are the p.d.f.'s 
in the initial state (and eventually the fragmentation functions in the final state), describing 
the density of the colliding partons within the protons, and how that density changes with 
the parton virtuality and momentum fraction. The p.d.f.'s cannot be computed - they must 
be given by the experiment - but their dependence on \ip can. If factorisation holds, the 
parton cross section may be computed as a power series in a s . The only limitation is then 
computational, and the state of the art is that some production rates with one (and very few 
with two) final-state particles are known at next-to-next-to-leading order (NNLO), while 
many with two and some with three (and for special cases with four) final-state particles 
are known at next-to-leading order (NLO). A matching accuracy is then required from the 
p.d.f.'s whose DGLAP evolution has been recently computed to NNLO [2,3]. 

2. Breaking factorisation 

Factorisation, though, ignores altogether the underlying event (UE), which can be opera- 
tively defined as whatever is in the pp interaction besides the primary scattering. In partic- 
ular, the UE includes the multiple-parton interactions as well as the interaction of spectator 
partons, i.e. other than the ones initiating the primary scattering, the assumption being that 
if such interactions occur they are characterised by a scale A of the order of a GeV, and 
so they are suppressed by powers of A 2 /Q 2 with respect to the primary scattering. Thus, 
the UE breaks factorisation by means of power-suppressed contributions. How important 
are they for a precision calculation ? There is no obvious answer to this question since 
of course we cannot use the pQCD framework to model the UE, and its analysis must 
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rely solely upon the data. In pp collisions at the Tevatron, the UE is being studied [4] by 
analysing in single-jet production the charged-particle multiplicity in regions which are 
perpendicular in azimuth to the jet, since that region is expected to be sensitive to the UE. 
The UE sensitivity to beam remnants and to multiple interactions can be reduced by select- 
ing back-to-back two-jet topologies. A similar investigation is being planned also through 
the Drell-Yan production of vector bosons. 

Other examples of factorisation-breaking contributions are: a) the power corrections: 
Monte Carlo (MC) and theory modelling of power corrections were laid out and tested at 
LEP, where they also provided a determination of as [5]. However, those models still need 
be tested in hadron collisions: a study of single-jet production at the Tevatron running at 
two different centre-of-mass energies shows that the Bjorken scaling is violated more than 
logarithmically: data can fitted by assuming a power-correction shift in the jet Et [6]; b) 
diffractive events [7], which are known to violate factorisation at the Tevatron [8,9]. 




3. Monte Carlo models 



The detection of Higgs and BSM signals requires a precise modelling of their backgrounds. 
Examples are QCD production of W + 4 jets and of WW + 2 jets, which are backgrounds 
to Higgs production through vector-boson fusion (VBF) with the Higgs decay into a WW 
pair, as well to it production, or W + 6 jets and WW + 4 jets, which are backgrounds 
to Hit production. The huge amount of phase space available at the LHC, Fig. 1, as 
well as the large acceptance of the detectors, will make possible to produce final states 
with ten jets or more. One approach is to model QCD production through matrix-element 
MC generators, which provide an automatic computer generation of processes with many 
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jets, and/or vector or Higgs bosons. There are several such multi-purpose generators, 
like e.g. ALPGEN [11,12], ARIADNE [13], MADGRAPH/MADEVENT [14,15], COM- 
PHEP [16], GRACE/GR @ PPA [17,18], HELAC [19], and SHERPA [20] (which has got 
its own showering and hadronisation). A different example is PHASE/PHANTOM [21], 
a MC generator dedicated to processes with six final-state partons only - thus suitable to 
tt production, WW scattering, Higgs production via VBF and vector-boson gauge cou- 
pling studies, but where no approximation is used. Matrix-element MC generators are 
particularly suitable to studies which involve the geometry of the event, because the jets 
in the final state are generated at the matrix-element level, and thus exactly at any angle. 
In addition, they can be interfaced to parton-shower MC generators, like HERWIG [22] or 
PYTHIA [23], to include showering and hadronisation. In the context of matrix-element 
generators, the presence of several hard scales, like the mass of high-mass states, the jet 
transverse energy and the dijet invariant masses, makes an operative implementation of 
factorisation rather involved. The issue can be tackled through the CKKW [24,25] (or 
CKKW-like [26]) procedure. Within a given jet cross section, CKKW interfaces parton 
subprocesses with a different number of final states to parton showers. 

Finally, MC@NLO [27]: a procedure and a code to match exact NLO computations to 
shower MC generators. In a way, this is the most desirable procedure, because it embodies 
the precision of NLO parton calculations in predicting the overall normalisation of the 
event, while generating a realistic event set up through showering and hadronisation. It 
cannot be, though, multi-purpose, being obviously limited to the processes for which the 
NLO corrections are known. 



4. NLO calculations 

Another approach to the evaluation of the parton cross section a is through fixed-order 
computations. These yield only a limited access to the final-state structure, but have the 
advantage that higher-order corrections, real and virtual, can be included exactly. The vir- 
tual corrections will depend on a fictitious scale \i R , at which the scattering amplitudes 
are renormalised to take care of the ultraviolet divergences. NLO calculations have several 
desirable features, a) the jet structure: while in a leading-order calculation the jets have a 
trivial structure because each parton becomes a jet, to NLO the final-state collinear radia- 
tion allows up to two partons to enter a jet; b) a more refined p.d.f. evolution through the 
initial-state collinear radiation; c) the opening of new channels, through the inclusion of 
parton sub-processes which are not allowed to leading order; d) a reduced sensitivity to the 
fictitious input scales and /i p allows to predict the normalisation of physical observ- 
ables, which is usually not accurate to leading order. That is the first step toward precision 
measurements in general, and in particular toward an accurate estimate of signal and back- 
ground for Higgs and New Physics at LHC; e) finally, the matching with a parton-shower 
MC generator, like MC@NLO, as mentioned in Sect. 3. . 

Here we remind briefly how a NLO calculation is organised. Let us consider the produc- 
tion of n jets in hadron collisions. There are two types of contributions to a: the tree-level 
production with n + 1 final-state partons, with one of the partons that is undetected, and 
the one-loop production with n final-state partons. Schematically, 




(1) 
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where da B is the Born cross section, and 



a NLO = f da R + [ da v . 



(2) 



J n+l J n 



Both real and virtual contributions to Eq. (2), contain IR, i.e. collinear and soft, singu- 
larities. If in order to regulate those divergences one uses the dimensional regularisation, 
which fixes the dimensions of space-time to be d = 4 — 2e, then one finds that both terms 
on the right-hand side of Eq. (2) are divergent at d = 4. However, the structure of QCD 
is such that those singularities are universal, i.e. they do not depend on the process under 
consideration, but only on the partons involved in generating the singularity. Thus, in the 
90's process-independent procedures were devised to regulate those divergences. They are 
conventionally called slicing [28,29], subtraction [30,31] and dipole subtraction [32], and 
use universal counterterms to subtract the divergences. The NLO contribution, Eq. (2), can 
be written as, 



such that both sums of bracketed terms on the right-hand side of Eq. (3) are finite at d = 4, 
and thus readily integrable numerically via a computer code, with arbitrary selection cuts 
on the final-state particles and jets, as eventually required by a detector simulation. The 
organisation of NLO computations in process-independent procedures has made them an 
essential tool in the comparison with the experimental data. 

Let us look briefly at the history of NLO calculations: the first final-state distribution 
to NLO was computed for e + e~ — > 3 jets [33]. The addition of just one more jet in the 
final state, to produce e+e~ — ► 4 jets, took about 15 years [34,35]. This trend, namely 
the great difficulty in adding one more jet to a given final-state distribution, is repeated 
in all the other NLO calculations: in Drell-Yan with one associated jet [29], and with two 
associated jets [36]; in one- or two-jet production in hadron collisions [29,37], and in three- 
jet production [38,39]; in di-photon production in hadron collisions [40,41], and in the 
same with one associated jet [42]. Conversely, for other distributions in hadron collisions, 
like heavy-quark pair production [43], vector-boson pair production (including the spin 
correlations) [44,45], Drell-Yan with a heavy-quark pair [36], Higgs production via gluon 
fusion with one associated jet [46,47], Higgs production via vector-boson fusion with two 
associated jets [48], Higgs production with a heavy-quark pair [49,50], the addition of 
just one more jet has not been achieved yet to NLO. Furthermore, all the hadron-initiated 
distributions above have no more than three final-state particles (only very special cases 
with four final-state particles, like the NLO corrections to the electroweak production of a 
vector-boson pair + 2 jets [51], are known). 

Why in a NLO calculation is it so difficult to add more particles in the final state ? The 
loop integrals occurring in the virtual contributions to Eq. (2) are involved and process 
dependent. In addition, more final-state particles imply more scales in the process, and 
so lenghtier analytic expressions in the loop integral. In fact, the only known complete 
one-loop amplitudes with hexagon loops are the ones for six gluons [104,53-59] and for 
the electroweak corrections to e + e~ — ► 4 fermions [60]; with heptagon loops are the ones 
for seven gluons in N=4 super-yang-Mills theory [61]. Recently, a twistor-inspired ap- 
proach [62], which has allowed for great advances in the analytic computation of tree and 
one-loop amplitudes [63-66], as well as several semi-numerical approaches which show 




(3) 
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promise to handle NLO corrections in an automated way [67-75], have appeared. How- 
ever, the programme of applying sistematically NLO computations to studies of signals 
and backgrounds for Higgs and New Physics is still in its infancy and will be undoubtedly 
receive a lot of attention in the next future. 



5. NNLO calculations 

Are NLO computations accurate enough to describe the data ? The answer to that question 
is of course process dependent. Here I shall give a few examples: 

• b production at the Tevatron It has been long thought that the CDF data for b quark 
production were not in agreement with the NLO prediction (for a historical overview, 
see Ref. [76]). However, in the comparison of the CDF Run II data [77] for the 
J 1 1\) momentum distribution in inclusive B — ► J/tp + X decays to the NLO pre- 
diction [78] and to MC@NLO [79], one finds that the data lie within the theory 
uncertainty band and are in good agreement with the theory predictions. 

• W production at the LHC The Drell-Yan W cross section, with leptonic decay of the 
W boson, has been proposed as a luminosity monitor of the LHC [80], warranting 
a greater accuracy, of the order of a few percent, than the standard determination of 
the luminosity through the total hadronic cross section. However, the experimental 
W cross section depends on the acceptance, i.e. the fraction of events which pass the 
selection cuts. Thus, the accuracy of the luminosity monitor, the standard candle, 
depends on the one of the acceptance, which is related to the precision by which the 
hard cross section is known. In Ref. [81] the W cross section has been computed 
to different accuracies: to leading order, the same + HERWIG, NLO, MC@NLO, 
with or without including the spin correlations between the decay leptons and the 
partons entering the hard scattering. It was found that the difference between the 
NLO calculation and MC@NLO is about 2 — 3%, which is much less than the dif- 
ference between the same calculations with and without spin correlations. There- 
fore, to whatever accuracy we may compute the W cross section, if we want to use 
it as a standard candle it is mandatory to include the spin correlations. Recently, a 
calculation of the NNLO corrections, including the spin correlations, has been com- 
pleted [82]. It shows that the NNLO corrections differ from the NLO corrections 
more than 2 — 3% only if severe acceptance cuts on the px of the outgoing electron 
are used, which restrict drastically the available phase space. 

• Higgs production at the LHC At hadron colliders, the leading production mode for 
the Higgs is via gluon-gluon fusion through the mediation of a heavy-quark (mostly 
top-quark) loop. The NLO corrections to fully inclusive Higgs production via gluon- 
gluon fusion, including the heavy-quark mass dependence, required an evaluation 
at two-loop accuracy, and were found to be as large [83,84] as the leading-order 
calculation. That situation was unsatisfactory, because it called for a calculation to 
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NNLO 1 , which requires an evaluation at three-loop accuracy. However, in the large- 
ly limit, i.e. when the Higgs mass is smaller than the threshold for the creation of a 
top-quark pair, m H < 2nit, the coupling of the Higgs to the gluons via a top-quark 
loop can be replaced by an effective coupling. That reduces the number of loops in 
a given diagram by one. The NNLO corrections have been evaluated in the large- 
mi limit [86-88] and display a modest increase with respect to the NLO evaluation, 
showing that the calculation stabilises to NNLO. 

In the examples above I stressed how the central value of a prediction may change when 
going from leading order to NLO and eventually to NNLO. However, a benefit of going 
from leading order to NLO and then to NNLO is the reduction of the theory uncertainty 
band, due to the lesser sensitivity to the fictitious input scales [ir and [if of the calculation. 
Thus, a lot of theoretical activity has been directed in the last years toward the calculation 
of cross sections to NNLO accuracy. The total cross section [86,89] and the rapidity dis- 
tribution [90,91] for Drell-Yan W, Z production are known to NNLO accuracy. So are 
the total cross section [86-88] and the rapidity distribution [85] for Higgs production via 
gluon-gluon fusion, in the large-m t limit. However, only the calculation of Ref. [85], 
which has been extended to include the di-photon background [93], and Ref. [82] allow 
the use of arbitrary selection cuts. 

Basically, there are three ways of computing NNLO corrections: 

• Analytic integration, which is the first method to have been used [89], and may in- 
clude a limited class of acceptance cuts by modelling cuts as "propagators" [90,92]. 
Besides total cross sections, it has been used to produce the NNLO differential rates 
of Ref. [91]. 

• Sector decomposition, which is flexible enough to include any acceptance 
cuts [94-97], and has been used to produce the NNLO differential rates of 
Refs. [82,85,93] and of e + e~ — > 2 jets [98]. The cancellation of the IR divergences 
is performed numerically. 

• Subtraction, for which the cancellation of the divergences is organised in a process- 
independent way by exploiting the universal structure of the IR divergences, 
in particular the universal structure of the three-parton tree-level splitting func- 
tions [99-103] and of the two-parton one-loop splitting functions [104-108]. Al- 
though, the universal splitting functions have been known for some time, the cancel- 
lation of the IR divergences to NNLO is very intricate [109-1 17], and except for test 
cases like e+e~ — > 2 jets [112,117] and for parts of e+e~ — + 3 jets [116], no NNLO 
numerical code has been devised yet. 



x One must keep in mind that the calculation of Ref. [83,84] is fully inclusive, thus for an ideal 
detector with a 4ir coverage. If selection cuts are applied, like in Ref. [85], where Higgs production 
via gluon-gluon fusion is computed to NLO and to NNLO with a jet veto, the higher-order correc- 
tions may be not so large as in the fully inclusive calculation. Thus the ultimate judgement on the 
usefulness of a NNLO evaluation rests on an analysis with the cuts which will be used in the realistic 
simulations of the ATLAS and CMS detectors. 
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6. The parton distribution functions 

As outlined in the Introduction, at hadron colliders the theory cross section can be writ- 
ten using factorisation as a convolution of the parton cross section with the p.d.f.'s. The 
dependence of the p.d.f.'s on Q 2 is given by the DGLAP evolution equations. In those 
equations, the evolution in Q 2 is driven by the splitting functions, which are perturbatively 
computable. By consistency, in the factorisation formula the parton cross section and the 
splitting functions must be determined to the same accuracy. The leading-order [1 18,1 19] 
and NLO [120] splitting functions have been known for a long time. The calculation of 
the NNLO splitting functions has been completed recently [2,3], setting the record as the 
toughest calculation ever performed in perturbative QCD: it took the equivalent of 20 man- 
years, and about a million lines of dedicated algebra code. The p.d.f.'s obtained by global 
fits [121] 2 of all accessible collider and fixed-target data can be evolved to the large kine- 
matic range accessible through the LHC. In global fits, the fit is performed by minimising 
the x 2 to a ll the data. The evolution is started at some value Qq, where the p.d.f. is some 
suitable function of x. In addition, to avoid higher-twist contaminations, the data are se- 
lected above a certain momentum transfer and energy, Q 2 > and W 2 > W^ in . 
Recently, though, also an evaluation of the A% 2 , i.e. of the uncertainties arising from the 
errors on the experimental data, has been performed [122-126], using either the Hessian 
or the Lagrange-multiplier methods 3 . 

7. Conclusions 

QCD in the high-energy regime is constantly making progress. Here I have reviewed the 
considerable advances achieved in the last few years in the sectors of QCD which are most 
actively studied: Monte Carlo models, higher-order corrections and p.d.f.'s. More progress 
can be anticipated in modelling the underlying event, in improving the factorisation picture 
in Monte Carlo models through the use of CKKW-like procedures, in the automatisation 
of NLO calculations, and in a working NNLO numerical code based on subtraction, thus 
making high-energy QCD more and more an essential precision toolkit to analyse Higgs 
and New Physics scenarios at the LHC. 
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2 In Ref. [121], which pre-dates Refs. [2,3], the NNLO global fit is based on a few NNLO fixed 
moments, which were known at that time. 

3 Accordingly, in connection with the use of W, Z production as a parton luminosity monitor men- 
tioned in Sect. 5., Ref. [126] estimates a 4% uncertainty on the Drell-Yan W, Z production cross 
section. 
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